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Abstract: As the leading cause of dementia worldwide, Alzheimer's disease afflicts 
millions, with progressively impaired abilities to carry out daily activities or 
communicate and even recognize faces. Although the cause behind lupus is not fully 
understood, it probably reflects lifestyle choices and environmental factors as well as 
genetic propensity. The largest obstacles in the diagnosis of these diseases are their 
often subtle early manifestations and absence of sensitive detection paradigms. Deep- 
learning algorithms first came to the forefront of medical imaging just a few years ago 
and were celebrated as sophisticated diagnostic aids, able to spot subtle signs in scans 
usually hidden from human eyes. We are benefitting from the use of these state-of- 
the-art algorithms to improve Alzheimer's detection, with one of the largest MRI 
datasets available today (more than 86,000 images) being used to train our model. In 
view of this vast data set, it was appreciably combined one to be accurate-centric 
diagnostic tool. The performance of our novel deep learning model is strong and 
provided state-of-the-art validation accuracy (99.63%), surpassing existing models 
These figures highlight the great promise of our model as a verifiable method for 
detecting early-stage Alzheimer's disease - a significant concern in controlling and 
managing disease progression. Our research truly is a major step forward in the field 
of Alzheimer's disease diagnosis by employing cutting-edge deep learning techniques. 
Early diagnosis allows for better treatment and lower disease burden that can prevent 
morbidity, mortality and even change many patient outcomes. This is a considerable 
improvement toward diagnosing Alzheimer's disease with the help of artificial 
intelligence and presents an expectation for more exact and timely finding. 


Introduction 

Alzheimer’s disease is a condition that has an impact 
on the brain, leading to memory issues and changes in 
behaviour. It mainly impacts individuals making it 
challenging for them to perform tasks and causes a loss of 
independence. The presence of protein deposits like 
tangles and amyloid plaques in the brain disturbs 
communication between neurons and leads to the death of 
brain cells, which are key indicators of Alzheimer’s 


condition. The precise reason behind Alzheimer’s disease 
is still a mystery. Finding a cure remains challenging, 
hence, early detection and precise diagnosis play a role in 
managing and treating the condition despite research 
efforts in this field (Madhu et al., 2024; Biswas et al., 
2024; Saha et al., 2024). Studies suggest that Alzheimer’s 
disease is a contributor to the mortality rate among 
though the 
uncertain. Moreover, there is a lack of confirmation 


Americans, exact percentage remains 
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through autopsies citing Alzheimer’s as the main cause, 
making it harder to accurately assess the disease's impact 
(Alzheimer’s Association, 2024). 

Advances in technology have greatly impacted 
diagnosis in the realm of neurology. Convolutional neural 
networks (CNNs), a form of deep learning technology, 
show potential in aiding the diagnosis of Alzheimer’s 
disease. Through the analysis of medical imaging data, 
like PET scans and MRIs, deep learning algorithms can 
identify patterns and irregularities associated with 
Alzheimer’s disease with accuracy. 

Deep learning algorithms offer an effective way to 
diagnose compared to traditional methods that may 
overlook minute variations (Rao et al., 2022). Our study 
includes the use of standard CNN models such as 
ResNet50 (He et al., 2023; Hossain et al., 2022), VGG16 
(Pora et al., 2023), Xception (Chollet, 2017; Gülmez, 
2022), and a custom Deep CNN model. Custom model 
demonstrated better performance in terms of accuracy 
and precision when detecting Alzheimer’s disease 
compared to standard models. 


Related work 

According to Beheshti et al. (2017) early prediction of 
AD and the manner of changing MCI to AD with 
structural MRI scans is achieved through a CAD system. 
The system employs a feature-rating and genetic set of 
rules-based technique. It accomplished 93.01% accuracy 
for AD class and 75% accuracy for MCI conversion 
prediction. This CAD method has the capability to have 
an influence over conventional diagnostic strategies. 
(2020) used 2D-DCNN to classify 
Alzheimer's disease based on MRI images, with a focus 
on imbalanced classes. The proposed model achieves a 


Nawaz et al. 


remarkable 99.89% accuracy in classifying MRI scans 
into three categories: AD, MCI, and normal control. This 
approach simplifies the network by converting 3D MRI 
scans into 2D slices and handles class imbalances without 
requiring data augmentation. Traditionally, imbalanced 
datasets can pose a significant challenge for machine 
learning models. 

Bringas et al. (2020) collected data from 35 patients 
having Alzheimer’s disease by using smartphones for a 
week. The proposed approach used a CNN model to 
recognize patterns in the time series data. The model 
achieved an accuracy of 90.01%, improving traditional 
feature-based classification. Hasan et al. (2021) method 
leverages a generative network with deep convolution for 
data augmentation, fine-tuned VGG16 architecture, and 
support vector machine (SVM). The proposed method 
outperforms existing 


approaches, achieving high 
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accuracy and precision in diagnosing Alzheimer's disease 
at its initial stages. The combined utilization of DCGAN, 
VGG16, and SVM displays the potential for new 
approaches to enhance the field of medical analysis and 
contributes to the ongoing efforts to combat Alzheimer's 
disease. 

Raju et al. (2021) used transfer learning with VGG16 
to perform multi-class classification of Alzheimer’s 
disease using MRI scans. This approach achieved an 
accuracy of 99% by using pre-trained models, reducing 
the need for vast training data. "Conditional Triplet 
Network" is used by Orouskhani et al. (2022) in their 
research to improve the accuracy of AD diagnosis using 
brain MRI scans. The model uses a conditional triplet that 
addresses the limited data problem, and it is based on a 
modified VGG16. The model achieved remarkable 
results, outperforming existing models, with 99.41% 
accuracy for classification. 

Study by Saleem et al. (2022) explores the DL 
techniques for Alzheimer's diagnosis. It talks about 
different datasets for AD diagnosis, with an emphasis on 
deep learning techniques such as CNNs and FCNNs. In 
the pursuit of a more precise and effective diagnosis of 
AD, the review notes that CNNs are the most widely used 
DL technique and highlights persistent issues like 
overfitting and data quality. Al Shehri (2022) used 
Densenet-169 and Resnet-50 to classify Alzheimer’s 
disease into four sub classes. The Denset-169 attained 
test accuracy of 83.82 while Resnet-50 attained test 
accuracy of 81.92, outperforming in the training and 
testing phases. 

Using 2D Tl-weighted MRI brain images, EL- 
Geneedy et al. (2023) presented a DL technique for 
Alzheimer's disease. The pipeline produced testing 
accuracy of 99.68% in differentiating between various 
stages of AD and normal cognition. It per-forms better 
than other deep learning models, demonstrating its 
promise for accurate and timely diagnosis of AD. 
Alsubaie et al. (2024) review paper emphasizes on deep 
learning algorithms potential and limitations in 
Alzheimer’s disease diagnosis. This study explores 
various convolutional neural networks, recurrent neural 
networks, and generative models to evaluate AD 
classification. 

Rao et al. (2023) speak about the effective methods 
for identifying medical leaves by hybridizing Gaussian 
Mixture Model (GMM) and Convolutional Neural 
Networks (CNN), which boosts accuracy in leave 
classification and validates significant raises over 
(2023) 
discovers Machine learning methods that are used for 


traditional methods. Yavanamandha et al. 
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gesture recognition to enable communication for deaf and 
dumb people. It includes CNN for extracting features, 
RNN to handle temporal dependencies and SVM for 
classification tasks. Rao et al. (2023) presents an 
automated system for classifying Ayurvedic plants using 
machine learning and also pulls Convolutional Neural 
Networks (CNNs) to enrich the accuracy of plant species 
classification and recognition, addressing challenges in 
former methods that fell short in accurate classification 
and categorization. Mishra et al. (2023) discuss the 
Machine learning methods for classifying skin diseases 
using Convolutional Neural Networks (CNNs) and 
Support Vector Machines (SVMs) that show efficiency 
accuracy and call into question of these machine learning 
techniques in dermatological diseases. 
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based solver and tenfold cross-validation are used to 
enrich heart disease prediction. This improved machine 
learning model is appraised using the UCI Repository 
dataset with 90.16% superior results (Narisetty et al., 
2023). Afzal et al. (2019) recommended a brand-new 
model for addressing class imbalances. The author 
introduced a method that associates class balancing 
techniques with feature developments by signifying 
improved presentation in machine learning tasks with 
imbalanced datasets, which shows far-reaching results 
using Convolution Neural networks (Keerthana et al., 
2023). 

Yildirim and Cinar (2020) developed a hybrid CNN 
model via Resnet50 as a core model to diagnose 


Alzheimer's disease stage classification that classify the 
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Figure 1. Inception V3 Architecture. 
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Figure 2. Alexnet Architecture. 


Kompalli et al. (2023), with the help of a tool that is 
designed to support colour-blind people by using the 
image processing technique openCV, indicates the name 
RGB(Red, Green, Blue)values and helps people to 
discriminate and shades. Logical 
regression in line with hyperparameter tuning via grid- 


identify several 
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various stages of the disease. Khvostikov et al. (2018) 
emphases on Alzheimer disease medical image analysis 
and proposes a Convolution neural network (CNN) 
algorithm that rages sMRI and DTI modalities on the 
hippocampal ROI, using data from the ADNI database. 
The suggested results stand out when compared with 
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many other approaches and this study also introduced 
data argumentative for class balancing. Basheera and 
Satya Sai Ram (2020) objective of the study is to depict a 
model that differentiates. Mild Cognitive Impairment 
(MCI) and Cognitive Normal (CN) that is used to predict 
early diagnosis of the disease. The model has been tested 
with 10 fold cross validation that outperforms with 98% 
accuracy. 


c)Non-demented 


Risk of Overfitting: AlexNet is prone to overfitting, 
particularly when working with small datasets. 


Materials and Methods 
Dataset 

Deep Convolutional Neural Networks are a type of 
network specifically created for tasks that involve 
analyzing images and spatial data (Zhang et al., 2018). 


d)Very mild 


Figure 3. Alzheimer’s disease dataset. 


Existing System 
InceptionV3 

A DCNN architecture called InceptionV3 (Guan et al., 
2019) is well-known for its 
classification accuracy (figure 1). 


exceptional image 
Challenges: 

Complexity: Each Inception module contains multiple 
parallel convolutional operations, making the architecture 
somewhat complex. 

Resource-intensive: InceptionV3 is less appropriate 
for real-time applications since it necessitates a large 
amount of computational power. 

Alexnet 

AlexNet (Alom et al., 2018) can automatically extract 
hierarchical features from raw image data because it has 
several convolutional and pooling layers (figure 2). 


Challenges: 

Computational Intensity: The depth and abundance of 
parameters in AlexNet make it computationally 
demanding. Real-time application training and 
deployment of  AlexNet can be difficult. 
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These networks have the ability to identify patterns, in 
data by detecting features in input images. By going 
through layers, these networks can abstract information 
more and more while also capturing spatial relationships 
through convolution processes. 

The significance of Deep CNNs in the field of 
intelligence is evident through the accomplishments of 
architectures like VGG16, ResNet50 and Xception across 
tasks. In particular, Deep CNNs play a role in examining 
MRI data to detect AD at a stage. These networks 
automatically extract patterns and characteristics from 
brain images. 

The “Oasis Alzheimer’s Dataset” (Aithal, 2023) on 
Kaggle comprises of over 86,000 images divided into 
four categories, shown in Figure 3: ‘Very Mild 
Dementia’, ‘Mild Dementia’, ‘Moderate Dementia’ and 
‘Non Demented’. This dataset is useful for training 
models due to its coverage of age groups and spatial 
the The inclusion of 
understanding of 


arrangements in 
subcategories also 


images. 
offers an 
Alzheimer’s disease, aiding in diagnosis and influencing 
treatment plans. 
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Data preprocessing 

During preprocessing of the images from the "Oasis 
Alzheimer’s Dataset" are being readied for training a 
learning model. The file paths of each image sourced 
from the subclasses are combined with their classes. 
Following that, a one-hot encoding method is utilized to 
represent the labels of these subclasses. Each subclass 
receives a value, and the encoder is adjusted accordingly 
to reflect this mapping. Subsequently, the images are 
resized to 128x128 pixels and transformed into a NumPy 
array. If an image meets the size criteria of 128x128, it is 
included in the data set along with its corresponding one 
label, in label set. 


Batch Normalization 
Max Pooling(2,2) 


Conv-1 
32 Filters, ReLU 
Input shape - 128, 128 


64 Filters, ReLU 


Very Mild Demented - 3 


Encoding 


prepare it for input into fully connected layers. Two 
connected layers with 256 and 128 units, respectively are 
added, both employing ReLU activation functions. L2 
regularization (refer with: Equation 2) is employed with a 
penalty value of 0.001 on these layers weights to prevent 
overfitting. 

J(8) = = YI, Cost(he(x), y) += X7- 16)! 

(2) 

L2 regularization does not mandate the model’s 
coefficients to be zero, rather it prefers them to be small. 
It prevents overfitting by utilizing multiple features. 


Dropout regularization, with a rate of 0.2 is 
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Figure 4. Proposed system architecture. 


DCNN model 

The model follows a step-by-step approach, stacking 
layers one on top of the other, consisting of a total of 15 
layers, as shown in Figure 4. The primary layer consists 
of ReLU activation (refer with: Equation 1) and 32 filters 
sized at (3, 3). This layer processes input data in the form 
of (128, 128 3) representing images with dimensions of 
128 by 128 pixels and three RGB color channels. 


f(x) = max (0, x) () 


ReLU and its derivative are both the same. If the 
function gets an input of negative values, it returns 0. 
Otherwise, it returns the same value when it is positive. 
Due to this, the range of ReLU function is from zero to 
infinite. 

Batch normalization is applied after each 
convolutional layer to standardize the input data and 
enhance the efficiency of network training. Max pooling 
with a size of (2, 2) is utilized to reduce the dimensions 
of the feature maps identifying details within the images. 
The output from the layers is flattened into one array to 
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implemented after each layer to mitigate overfitting risks 
by deactivating 20% of neurons during training sessions. 
The final each 
representing a class and providing probabilities, for each 
class through SoftMax activation. 


layer consists of four neurons, 


Evaluation metrics 

We evaluated the performance of each method using 
metrics like accuracy, loss, f1 score, and precision to 
conclude which implemented model is better. 


Result and Discussion 

The dataset was utilized with models like ResNet50, 
VGGI16 and Xception, as shown in Figure 5. Table 1 
depicts the experiment results for Accuracy, Loss, 
Precision, and F-score. VGG16 stands out for its 
accuracy of 92.03% and depth and image categorization 
capabilities. Similarly, Xception excelled with an 
accuracy of 93.93% due to its inception modules 
adeptness at capturing features across scales. ResNetS0, a 


learning architecture achieved an impressive accuracy 


score of 97.50%. On the other hand, our customized Deep 
CNN surpassed all models with an exceptional accuracy model’s accuracy changes during training and validation. 
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The accuracy graph in Figure 6 illustrates how the 


rate of 99.63%. The validation accuracy curve aligns closely with the 
Table 1. Experiment Results for Accuracy, Loss, Precision and F-score. 
Model Validation Accuracy Validation Loss Precision F1 score 
Resnet50 97.50 10.31 97.53 97.50 
VGG16 92.03 20.93 92.39 92.03 
Xception 93.93 25.56 94.46 93.93 
DCNN 99.63 8.59 99.75 99.63 
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Figure 5. Accuracy comparison among all models. 
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Figure 6. Training and validation accuracy graph for DCNN model. 
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training accuracy even though there are few drops. This 
suggests that the model is effectively adapting to unseen 
data meaning the model is understanding complex 
patterns. 


achieves an AUC ROC of 1, whereas a random classifier 


is associated with an AUC ROC of 0.5. 


he confusion matrix in Figure 9 represents how a 


classification model works by comparing the predicted 
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Figure 7. Training and validation loss graph for DCNN model. 
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Figure 8. ROC curve for DCNN model. 


The loss curve in Figure 7 illustrates the changes in 
the model’s loss during training and validation epochs. 
The training loss shows the difference between the 
expected values and the model’s predictions. There are 
times when the validation loss curve shows some spikes, 
but it stays in line with the training loss curve. This 
suggests that the model isn't memorizing noise in the data 
and overfitting; instead, it’s focusing on learning 


meaningful patterns and representations. 
The ROC curves area gives us a measure of the 
model’s performance in Figure 8. An ideal classifier 
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values with actual values in the training dataset. 

Similarly, the training and validation accuracy graph 
for Resnet50, Figure 10, shows that the model is not 
generalized to unseen data. The model memorized noise 
in the data rather than the patterns hence, overfitting 
occurred. 

In the training and validation loss graph for Resnet50, 
Figure 11, there is a sudden peak indicating that the 
model's performance on the training data has declined 
abruptly. It could be due to various reasons like an 
unstable learning rate or other issues during training. 
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Figure 9. Confusion matrix for DCNN model. 
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Figure 10. Training and validation accuracy graph for Resnet50. 


Training and Validation Loss 


—— Train Loss 
—— Validation Loss 


0 2 4 6 8 
Epochs 


Figure 11. Training and validation loss graph for Resnet50. 
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Figure 12(a). Training and validation accuracy graph for Xception (b) Training and validation loss 
graph for Xception. 
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Figure 13. Training and validation accuracy graph for VGG16. 
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Figure 14. Training and validation loss graph for VGG16. 


In case of Xception, both the accuracy graph, Figure 
12(a) and the loss graph, Figure 12(b), for the training 
and validation phases, have fluctuating validation curves. 
This indicates that the model is unstable and is learning 
variability. The inconsistent flow in the validation curve 
may indicate that the model is struggling to generalize 
due to overfitting. 

Whereas in the case of VGG16 Figure 13, the "S" 
shape in the validation curve indicates that the model has 
reached its maximum learning capacity and further 
training doesn't significantly improve performance. The 
model is underfitting as it fails to learn the complex 
patterns in the data. 

Even though the validation loss curve is on par with 
the training loss curve for VGG16, as shown in Figure 
14, the graph indicates that the model’s capacity and 
complexity are not enough to learn the patterns in data. 


Conclusion 
The use of deep convolutional neural networks 
(DCNNs) to detect Alzheimer’s disease from MRI scans 
represents an advancement in the domain of medicine. 
These sophisticated CNN structures have demonstrated 
promising outcomes in spotting patterns and irregularities 
in brain images, facilitating the diagnosis of AD. Moving 
forward, the future direction for detecting Alzheimer’s 
disease involves segmenting identified tumours within 
The development of CNN-based 
techniques can aid in pinpointing. 
s-related anomalies 


brain 
segmentation 


scans. 
Characterizing Alzheimer' can 
contribute to a thorough diagnosis. By integrating the 
capabilities of CNNs for detection with segmentation 
approaches, we can improve our understanding of the 
disease progression, potentially leading to more accurate 
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and personalized treatment strategies for individuals with 
Alzheimer’s disease. 
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